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Abstract 

We derive the class of normalized generalized Gamma processes from Poisson-Kingman mod- 
els (Pitman, 2003) with tempered a-stable mixing distribution. Relying on this construction it 
can be shown that in Bayesian nonparametrics, results on quantities of statistical interest under 
those priors, like the analogous of the Blackwell-MacQueen prediction rules or the distribution 
of the number of distinct elements observed in a sample, arise as immediate consequences of 
Pitman's results. 

Keywords: Exchangeable random partitions; Exponential tilting; Inverse Gaussian density; 
Random probability measures; Tempered stable laws. 

1 Introduction 

In Lijoi, Mcna and Priinster (2005) the normalized inverse Gaussian (N-IG) process has been 
introduced as an alternative to the Dirichlet process to be used in Bayesian nonparametric mixture 
modeling. By mimicking Ferguson's (1973) famous construction of the Dirichlet process, the authors 
define a random discrete probability measure P, on a Polish space (S,S), whose finite dimensional 
distributions have the multivariate law of a vector of n independent r.v.'s with inverse Gaussian 
distribution divided by their sum. Even if the authors observe that a N-IG prior, with non- atomic 
parameter measure, belongs to the class of species sampling models (Pitman, 1996), and more 
exactly to Poisson-Kingman models (Pitman, 2003), they do not derive such process as an element 
of the previous families, but obtain, independently of Pitman's results, both the analogous of the 
Blackwell-MacQueen prediction rules and the distribution of the number of distinct values observed 
in a sample. 

Here we show how the larger class of normalized generalized Gamma processes (N-GG), already 
considered in James (2002), may be derived from Poisson-Kingman models for random partitions 
of the positive integers. In particular these processes arise as random discrete probability measures 
whose ranked atoms follow a Poisson-Kingman distribution derived from an a— stable law with 
mixing distribution the exponentially tilted version of the stable density. It follows that N-GG 
priors induce exchangeable Gibbs partitions of type a, (see Gnedin and Pitman, 2005), hence, even if 
Pitman (2003) is not directly concerned with applications in Bayesian nonparametrics, distributional 
results on quantities of statistical interest under those priors, arise as straightforward consequences 
of Pitman's results. 

The paper is organized as follows. In Section 2 we recall the definition of exponentially tilted 
Poisson-Kingman models derived from a stable law of index a. In Section 3 we exploit results in 
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Pitman (2003) to derive a general expression for corresponding exchangeable partition probability 
function (EPPF) and analogous of the Blackwcll-MacQuccn prediction rules. Finally, in Section 4, 
the distribution of the number of blocks and its asymptoyic behaviour are obtained from results in 
Pitman (2002, 2003) and Gnedin and Pitman (2005). 

2 Exponentially tilted a— stable Poisson-Kingman models 

It is well known that, given a law Q on the space V\ of decreasing sequences of positive numbers 
with sum 1 and a law H(-) on a Polish space (S, S), a random discrete probability measure P on S 
may be defined by P(-) = Y^iLi Pi^x t {-), for Xi iid <~ H(-) and (Pi) <~ Q. Generalizing Kingman's 
(1975) construction of the Dirichlet process as a Gamma process with independent increments di- 
vided by the sum, Jim Pitman — in a stimulating paper available on his web page since 1995 and 
published in 2003 — introduces a large class of random discrete probability measures deriving the 
law of the atoms, in decreasing order, from the ordered points of a homogeneous Poisson process 
on (0, oo) with given Levy density divided by their sum. 

Definition 1. [Pitman (2003; Def. 3)] Let Pi = (Ji/T) be a ranked discrete distribution derived 
from the ranked points of a Poisson process with Levy density p of random lcnghts J\ > J2 > • • • > 
by normalizing their lenghts by their sum which is T. The law Q on v\ of the sequence (Pi) will 
be called the Poisson-Kingman distribution with Levy density p, and denoted PK(p). Denote by 
PK(p\t) the regular conditional distribution of (Pi) given (T = t) constructed above. For a proba- 
bility distribution 7 on (0, 00), let 



be the distribution on the space V{. Call PK(p,j) the Poisson-Kingman distribution with Levy 
density p and mixing distribution 7. 

Remark 1. In James, Lijoi and Prunster (2005) a very large class of Normalized Random Mea- 
sure (NRMs) based on a more complex generalization of Kingman's construction, has been intro- 
duced and deeply studied in a Bayesian nonparametric perspective. As the same authors point 
out, Pitman's PK(p,j) models provide an important extension of homogenous NMRs. Those 
model contain, among the others, the two-parameter Poisson-Dirichlct distribution, PD(a,9), for 
< a < 1 and 8 > —a, which is the law of the ranked atoms of the well-known extension of the 
Dirichlet process introduced in Pitman and Yor (1997). Pitman (2003) shows this family corre- 
sponds to a Poisson-Kingman model derived from a stable law of index a with mixing distribution 
= r(e/a+i) ^~ 6 ' fa,s(t), for f a .s(t) the density of the stable law. (See also Pitman, 2002, for an 
exaustive account of this family of distributions on V\_.) 

In what follows we shall actually focus on models PK(p a ,j) where p a is the Levy density of a 
stable density of index a £ (0, 1). The reason lies in Theorem 8, Section 5.3, in Pitman (2003) and 
will be clarified later. 

First recall that, given a strictly positive r.v. T, with density fx and Laplace transform E(e~ XT ) = 
e — i/>(A) _ j°° e -At f T (t)dt, where, according to Levy-Kintchine formula, tp(X) — J °°(l — e~ Xs )p(ds) 
is the Laplace exponent, /t is uniquely determined by its unique Levy density p. Now, one of 
the basic operations that leads to the larger class of PK(p,-y) models introduced in Definition 1 
is given by exponential tilting (see Pitman, 2003, Sec. 4.2). Tilting a positive random variable 




(1) 



2 



T is performed by multiplying its density by a factor exp{ip(X) — Xt}, or its Levy density by a 
factor exp{— Xt}. For our purposes it is worth to recall the definition of tempered stable law, first 
introduced in Tweedie (1984), also called generalized Gamma distributions in Brix (1999), (see 
Barndorff-Nielsen and Shephard, 2001, for a comprehensive account). 

Definition 2. [Tempered a-stable law] Let f a .s(t), a € (0,1), S £ (0, oo), denote the proba- 
bility density function of a positive a-stable law with Laplace exponent ip a (X) — S(2X) a . Apart 
from a = \ and a = ^, for a general a £ (0, 1) explicit expressions of this density are known only 
in the form of series representations: 

f*,s(t) = ^^ 1/ "E(-l)^ 1 Bin(^a) r( ^ + 1) 2^ 1 (^ 1/a )-^- 1 ). 

i ... . i 

By the change of variable A = ^y-, 7 € [0, 00), exponential tilting f a ,s{t) with exp{5"/ — ? a t}, gives 

the density of a tempered stable law of parameters (a, 5, 7), 

f a ,s,j(t) = e 5 ^ 1/at f a ,s(t). 

Although this density has no explicit expression, corresponding Laplace exponent and Levy density 
are known to be as follows: 

^(A)=-<5 7 + 5(7« +2A) Q and p%(s) = 52° °_ ^-"e^A (2) 

Example 1. [Inverse Gaussian law] The class of tempered a-stable laws contains the inverse 
Gaussian law. In fact, for a = ^ the stable density has the following explicit form 

, . S _3 

/i/2,5(*) = -j=< 2 e »*, 

V 27T 



and corresponding Laplace exponent ip(X) — Sy/2X. By exponential tilting with A = \ , the density 
of a tempered ^—stable law results 

/ Si7 (t) = ^e^t-l exp |-1 (,5V 1 + 7 2 i) } , (3) 

for 5 € (0,oo) and 7 € [0, 00), which is well-known to be the density of an inverse Gaussian (5, 7) 
law (see e.g. Seshadri, 1993). Corresponding Laplace exponent and Levy density easily follow from 
©: 

V'I(A) = -<57 + <5(7 2 + 2A)* and p\{s) = —- 8 -ie-^. 

2 2 V27T 



It is well known that normalized generalized Gamma priors, as introduced in Bayesian nonpara- 
metric context, (see James, 2002) corresponds to random discrete probability measures P(-) = 
SSi Pi^x { ) whose ranked atoms (Pi) follow a PK(p e a ) distribution. Nevertheless in Section 4.2 
(cfr. eq. (46)) Pitman shows that if p e is the tilted version of the Levy density of T, a model PK(p e ) 
is equivalent to a model PK(p,j e ) where p is the Levy density of T and the mixing distribution, 
7 e , is the tilted version of the density of T. This implies, for example, that the normalized inverse 
Gaussian prior of Lijoi, Mena and Prunster (2005) corresponds to a random discrete distribution 
whose ranked atoms have PK(pi,^ft) distribution. Relying on the previous considerations we are 
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now in a position to introduce the following definition: 

Definition 3. Let PK(p a ) be a Poisson-Kingman model derived from an a-stable law, and 
let 7„(i), for a 6 (0,1) denote the density of a tempered a— stable law. We call the family of 
distributions PK(p a ,j^) = P^{p e a ) on exponentially tilted a— stable Poisson-Kingman models. 

3 EPPF and predictive distributions 

In this section we derive distributional results for quantities of statistical interest in Bayesian non- 
parametric modeling under normalized generalized Gamma priors, exploiting Pitman's results for 
Poisson-Kingman models. First recall that, from Kingman's theory of exchangeable random parti- 
tions (Kingman, 1978), sampling from a random discrete distribution P, induces a random partition 
LT of the positive integers N, by the exchangeable equivalence relation i rj j <t=> Xj = Xj, that is to 
say two positive integers i and j belong to the same block of II if and only if Xi = Xj, where X,;|P 
are iid ~ P. It follows that, for each restriction IT„ = {A\, . . . , A k } of II to [n] = {1, ... , n}, and for 
each n = 1,2, . . ., Pr(LT n = {A l: . . . ,A k }) = p(m, . . .,n k ), where, for j = 1, 2, . . . , k, rij = \Aj\ > 1 
and Y]j—i n>j = n, for some non-negative symmetric function p of finite sequences of positive in- 
tegers called the exchangeable partition probability function (EPPF) determined by LT. In Hansen 
and Pitman (2000) it is shown that an infinite exchangeable sequence (X n ) admits prediction rules 
of the form 

Pr(x n+1 e .\Xi, . . .,x n ) = X>i,»<^;(-) + 1nH{-) (4) 

3=1 

where X*, for 1 < j • < K n are distinct values, in order of appearance, in [X\, . . . , X n ), pj^ n 
and q n are non- negative product measurable functions of (X\, . . . ,X n ), and H(-) is a non atomic 
probability measure on S, if and only if pj, n — p(n 3+ )/p(n) and q n = p(n k+1 )/p(n), where 
p(n) := p(m, . . . ,n k ), p(n J+ ) := p(ni, . . . , rij + 1, . . . , n k ), and p(n k+1 ) := p(m, . . . ,n k , 1)- Ex- 
changeable sequences admitting predictive distributions of this form are termed species sampling 
sequences, and their directing measures P are called species sampling models (Pitman, 1996). 

By construction, random discrete distributions derived by PK(p, 7) models belong to this class, 
therefore, to obtain from (fj]) general expressions for the predictive distributions, one just need to 
know the EPPFs. Indeed Pitman (2003) provides a thorough characterization of the laws PK(p) on 
V\ via their corresponding EPPFs. Specifically, according to Corollary 6, Sec. 3, for some random 
partition LT n = {A\, . . . , A k } of [n] — {1, . . . ,n}, with block sizes \Ai\ — rii for i = 1, . . . , k < n, the 
EPPF associated with each PK{p) is given by 

(_-\\n— k r<x> fe 

p K (n u ...,n k ) :=i— '— / \n-i e -ip(X)dX J~J ^ (A)dA, (5) 
T ( n ) Jo fJl 

where ip(X) is the Laplace exponent determined by p(-) and, for m = 1, . . . ,n, ip m (X) := -j^-ip(X). 
It follows that, having at hand the equivalence stated in Definition 3., the EPPF of a PK(p a ,j^) 
model can be easily deduced from ([5]) by substituting in the Laplace exponent of the tempered 
a— stable law given in ([2]). 

Proposition 1. Let j^,(t) denote the density of a tempered a-stable law, for a € (0,1), then 
the exchangeable partition probability function induced by an exchangeable sequence (X n ) whose 
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directing measure P has ranked atoms following a PK(p a ,^) distribution, results 

p(nu...,n k ) = ^ Fr ]l(l-a) nj ^l X^ - — dX, (6) 

where Tij — l | stands for the usual notation of rising factorials (x) n -\ = x(x + l)(x + 2) ■ ■ ■ (x + n— 1). 

Proof. By equation the Laplace exponent of 7„ is given by ij)%(X) = — £7 + 5(-fi + 2X) a , 
hence 

r jm m—l 
MX) := = ^"(7 = + 2A)- m (-l)" l - 1 a J[(a- i). 

i=l 

By substitution in ([5]) 



P (n u ...,n k )= UfL. [ X ^ e s^i + 2xr ]J S2 

r(n) 7 jLj 



(7- + 2A)"j- q 



]_]_(!- a) nj _i T / A 1 — — - dA, 



which reduces to 



r(n) JLlv ^ ^ nJ=i(7*+2A)%-° 

and the result follows. □ 

Relying on the previous result, the general expression for predictive distributions induced by expo- 
nentially tilted a-stable Poisson-Kingman models, easily follows. 

Corollary 1. An exchangeable sequence (X n ) whose directing measure P has ranked atoms (Pi) 
with distribution PK(p a ,^), has predictive distributions of the form (4) for 

p,>(nj = (rij - a) and g«(nj = ad (7) 

n r} n ,k ' n r] n> k 



where 



i*.* = I A " _1 r^ — ^ 1 dX - ( 8 ) 



n 



Example 2. [Normalized Inverse Gaussian process] Specializing ([6|) for a = 1/2 and 7 = 1 
one obtains 

Sxknn-k * /1\ /•<» -5(l+2A)i 

r(n) ^V2y„._ 1T y (1 + 2A)""' 

With some manipulations, and having at hand the definition of incomplete Gamma function, i.e. 
r(a; a;) = J t a ~ 1 e~ t dt, it is easy to see that (0 reduces to formula (Al) in Appendix A. 4 of Lijoi, 
Mena and Priinster (2005), and results in Proposition 3 arise by specializing ([7|) and Q for a = 1/2. 
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It is worth to notice that the EPPF in ^ defines an infinite Gibbs partition of type a, (see Gnedin 
and Pitman, 2005), namely, for all 1 < k < n, and all compositions (ni, . . . ,n k ) of n, and for each 
n > 1, it has Gibbs product form i.e. 

k 

p{n 1 ,...,n k ) = V n , k J[W nj , (10) 

i=i 

for VF = (Wj) non- negative weights and 

e s ^5 k a k 2 n f°° , e -<5(7«+2A)" 

Vn.fc = ^7"^; / A — — — — CM, 

r(n) 7 ( 7 « + 2A)"- fett 

and it is of type a, i.e. W„. = (1 — a)„,_ij. This is in line with a result stated in Pitman 
(2003) (cfr. Th. 8) and proved in Gnedin and Pitman (2005), according to which: a) an infinite 
exchangeable partitions II of N has EPPF in Gibbs form (|T0)) if and only if W n = (1 — a) n --if, for 
some a S (— oo, 1) and 6J for a E (0, 1) this characterizes EPPFs induced by PK(p a ,-f) partition 
models, (cfr. Th. 12, item (iii) in Gnedin and Pitman, 2005). 



4 Distribution of the number of blocks 

In Bayesian nonparametric mixture modeling context it is usual to give to the distribution of the 
number of blocks in the partition induced by the prior, the interpretation of a prior distribution for 
the number of components in the mixture model. Antoniak (1974) obtains the distribution of the 
number of component induced by a Dirichlet prior from the Ewens sampling formula, an equivalent 
of the EPPF for the Dirichlet case. From Gnedin and Pitman (2005) (cfr. eq. 10), an EPPF in 
Gibbs form induces the following distribution of the number of blocks K n , by summation over all 
partitions {A\, . . . ,Ak\ of [n] with k blocks, and \Aj\ = nj, j = 1, . . . , k, 

Pr{K n =k) = V n , k B nik (W), (11) 

where B n ^ k (W) = A k } n^=i W|a,-|j is known as the partial Bell polynomial in the variables 

W. A special form of (fTTjl for EPPFs in Gibbs form of type a can be easily derived to get the 
following result that don't need to be proved. 



Proposition 2. A sample (Xi, . . . , X n ) from a PK(p a ,j^) model induces the following distri- 
bution of the number of blocks K n : 

Pr(K n = k) = V ntk S a (n,k) (12) 

for 

e s ^S k a k 2 n f 00 j e -5(7«+2A)° 

V n .k — / A - — ; — CM, 



r(n) Jo (ji + 2A)«- fe « ' 



nl k 1 

S a {n,k) := B ntk ((l - a)._i T ) = — 2J IT ~\ ^ ~ a )«j-it 

(ni,...,n h ) 3=1 3 ' 
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where the sum extends over the space of all compositions (m, . . . , n^) of n. S a (n, k) is known as the 
generalized Stirling number of the first kind, and it has the following explicit formula ( cfr. Pitman, 
2002). 

^(n,fc) = ^?B- 1 ) J '(-)(-^W 



Example 3. The distribution of the number of components in a hierarchical Bayesian nonpara- 
mctric mixture model under a N-IG prior obtained in Lijoi, Mena and Priinstcr (2005), Proposition 
4., easily follows from (fT2"|) for a = 1/2. In fact, from formula (127) in Pitman (2003), 



5 <5 fc /2n-fc-l\ e - 5 d+2A)2 



hence, for 7 = 1, 

r(A:)2 n k \ n-1 J J Q (1 + 2A)"~ 5 

which is easy to show that reduces to equation (9) in Lijoi, Mena and Prunster (2005) by means of 
incomplete Gamma function substitutions. 

In Section 6.1 of Pitman (2003) the concept of a-diversity has been introduced for a random parti- 
tion IT with ranked frequencies (Pi) following a Poisson-Kingman model derived from an a-stablc 
law. 

Definition 4. [Pitman, 2003] An exchangeable partition II of the positive integers N has a- 
diversity S a , if and only if there exists a random variable S a , with < S a < oo a.s., such that, if 
K n is the number of blocks in the restriction of II to [n] , then 

K n a.s. „ /.„i 

> S a as n — > oo. (13) 

In Proposition 13, item (i), Pitman states that if II is a PK(p a , 7) partition of N for some a € (0, 1), 
then S a — T~ a , where T has distribution 7. Hence, by an elementary transformation, the asymp- 
totic distribution of K n /n a for a PK(p a ,j^) partition, easily follows: 

Proposition 3. Let K n be the number of blocks in a random partition of [n] induced by sam- 
pling from P, whose ranked atoms follow an exponentially tilted PK(p a ,j^) model. Since T has 
density 

tS(*) = ex P jfry - \i« * j f a ,d(t), 

for f a .s(t) the density of an a stable law, then 

a.s. ~ 

► S a as n — > 00 

n a 
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where S a has density 



Example 4. [Normalized inverse Gaussian process] For a — 1/2, 7 = 1 and exploiting the explicit 
form of the 1/2-stable density, l[T4"|) results: 

/ Sl/2 ( S ) = ^exp|5-i(5V + s - 2 )|. 
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